36 research outputs found

    A PROCESS FOR ACHIEVING COMPARABLE DATA FROM HETEROGENEOUS DATABASES

    Get PDF
    The current state of health and biomedicine includes an enormity of heterogeneous data ‘silos’, collected for different purposes and represented differently, that are presently impossible to share or analyze in toto. The greatest challenge for large-scale and meaningful analyses of health-related data is to achieve a uniform data representation for data extracted from heterogeneous source representations. Based upon an analysis and categorization of heterogeneities, a process for achieving comparable data content by using a uniform terminological representation is developed. This process addresses the types of representational heterogeneities that commonly arise in healthcare data integration problems. Specifically, this process uses a reference terminology, and associated maps to transform heterogeneous data to a standard representation for comparability and secondary use. The capture of quality and precision of the “maps” between local terms and reference terminology concepts enhances the meaning of the aggregated data, empowering end users with better-informed queries for subsequent analyses. A data integration case study in the domain of pediatric asthma illustrates the development and use of a reference terminology for creating comparable data from heterogeneous source representations. The contribution of this research is a generalized process for the integration of data from heterogeneous source representations, and this process can be applied and extended to other problems where heterogeneous data needs to be merged

    Desiderata for the development of next-generation electronic health record phenotype libraries

    Get PDF
    BackgroundHigh-quality phenotype definitions are desirable to enable the extraction of patient cohorts from large electronic health record repositories and are characterized by properties such as portability, reproducibility, and validity. Phenotype libraries, where definitions are stored, have the potential to contribute significantly to the quality of the definitions they host. In this work, we present a set of desiderata for the design of a next-generation phenotype library that is able to ensure the quality of hosted definitions by combining the functionality currently offered by disparate tooling.MethodsA group of researchers examined work to date on phenotype models, implementation, and validation, as well as contemporary phenotype libraries developed as a part of their own phenomics communities. Existing phenotype frameworks were also examined. This work was translated and refined by all the authors into a set of best practices.ResultsWe present 14 library desiderata that promote high-quality phenotype definitions, in the areas of modelling, logging, validation, and sharing and warehousing.ConclusionsThere are a number of choices to be made when constructing phenotype libraries. Our considerations distil the best practices in the field and include pointers towards their further development to support portable, reproducible, and clinically valid phenotype design. The provision of high-quality phenotype definitions enables electronic health record data to be more effectively used in medical domains

    The Rare Diseases Clinical Research Network Contact Registry Update: Features and Functionality

    No full text
    The Rare Diseases Clinical Research Network (RDCRN) Contact Registry has grown in size and scope since it was first reported in this journal in 2007. In this paper, we reflect on our seven years\u27 experience developing and expanding the RDCRN Contact Registry to include many more rare diseases. We present the functional and data requirements that motivated this registry, and the new features and policies that have been developed since. Given the high costs and long-term commitment required to build patient registries, the RDCRN Contact Registry experience represents a reasonable approach for identifying and cultivating potential research populations, with minimal resources and patient burden. The basic model of a patient-reported registry has not changed since our 2007 report, but the number of diseases has grown from 42 to 201, and the types of information that are exchanged with participants has expanded. A patient-directed information-sharing feature has been added to reduce barriers to communication between investigators and patients affected by rare and genetic diseases. As specific data and research needs arise, the Contact Registry can be leveraged to access needed data or to solicit patients for particular research opportunities. This multiple-disease registry is scalable, expandable, and standards-driven, and has become a model for clinical and translational research across rare and many other diseases

    Clinical research informatics

    No full text
    This book provides foundational coverage of key areas, concepts, constructs, and approaches of medical informatics as it applies to clinical research activities, in both current settings and in light of emerging policies. The field of clinical research is fully characterized (in terms of study design and overarching business processes), and there is emphasis on information management aspects and informatics implications (including needed activities) within various clinical research environments. The purpose of the book is to provide an overview of clinical research (types), activities, and ar
    corecore